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ABSTRACT 


The  purpose  of  our  investigation  was  to  determine  if  computer-based  selection  tests  could  predict  training 
track  assignment  for  student  naval  aviators.  This  study  evaluated  the  predictive  efficacy  of  an  experimental 
battery  of  computer-based  pilot  selection  tests  for  training  classification.  Student  naval  aviators  are  currently 
assigned  to  an  aircraft  training  track  based  primarily  on  performance  in  primary  training.  Students  were  tested 
on  the  experimental  test  battery  and  classified  into  one  of  three  aircraft  training  tracks  based  on  their  test  scores, 
The  resulting  classifications  were  compared  to  actual  selections  made  as  the  students  progressed  through  naval 
aviation  training.  Using  a  sample  of  237  students,  linear  analyses  were  conducted  to  evaluate  the  efficacy  of 
predicted  decisions.  The  unique  contribution  of  the  experimental  battery  was  determined  by  comparing  scores 
on  the  experimental  battery  to  scores  on  the  Navy/Marine  Corps  Aviation  Selection  Test  Battery,  a  paper-and- 
pencil  pilot  selection  test  used  by  the  United  States  Navy  and  Marine  Corps,  and  student  primary  flight  training 
grades.  A  significant  classification  model  including  one  of  the  experimental  selection  tests  was  derived.  The 
model  was  able  to  significantly  predict  fast  attack  pipeline  selections  before  flight  training. 
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INTRODUCTION 


Psychological  testing  has  proven  to  be  an  integral  component  in  the  medical  screening  of  potential  aviators. 
Performance  in  flight  training  has  been  statistically  linked  to  performance  on  a  broad  range  of  these  tests  (see 
Burke,  1993).  Research  has  grouped  these  tests  into  five  broad  areas:  1)  general  cognitive  ability,  2) 
information  processing  ability,  3)  psychomotor  coordination,  4)  personality  traits,  and  5)  background  (Burke, 
1993;  Street,  Helton,  <&  Dolgin,  1992).  General  cognitive  ability  has  been  the  most  v/idely  tested  domain  in 
actual  pilot  selection  (Burke,  1993).  For  example,  until  1993  the  U.S.  Navy  (USN)  and  Marine  Corps  based 
the  selection  of  physically  qualified  pilot  candidates  on  the  paper-and-pencil  Academic  Qualification  Test  (AQT), 
a  test  of  flight-related  academic  abilities,  and  the  Flight  Aptitude  Rating  (FAR),  an  aptitude-related  measure.  A 
revision  of  the  AQT/FAR,  the  Aviation  Selection  Test  Battery,  was  implemented  in  1993.  Similar  paper-and- 
pencil  tests,  the  U.S.  Army  (USA)  Flight  Aptitude  Selection  Test  (FAST)  and  the  U.S.  Air  Force  (USAF) 
Officer  Qualifications  Test  (AFOQT),  have  been  used  by  the  Army  and  Air  Force.  These  tests  are  generally 
economical  and  easily  administered  to  large  numbers  of  candidates.  Even  so,  general  cognitive  ability  has 
proven  to  be  of  somewhat  more  limited  value  than  other  domains  in  aviator  selection  research  (Burke,  1993). 
Specifically,  psychomotor  coordination  and  background  have  historically  accounted  for  more  variance  in 
predictions  of  flight  training  grades. 

Psychomotor  coordination  tests  have  been  the  most  robust  strategies  for  training  performance  prediction. 
These  strategies  typically  focus  on  eye-hand-foot  coordination  in  their  simplest  forms,  although  more  promising 
strategies  have  combined  such  skills  with  information  processing,  problem  solving,  and  reaction  time  in  an 
aircraft-like  environment  (Damos,  1987;  Blower  &  Dolgin,  1991).  Information  processing  tests  measure  the 
speed  and  efficiency  with  which  an  individual  makes  decisions  about  sensory  information  (i.e.,  hearing,  vision, 
and  touch)  in  an  aircrew  environment.  Of  particular  importance  ate  the  strategies  implemented  by  the  USA  and 
USAF.  The  first  operational  psychomotor  and  information-processing  selection  battery,  the  computer-based 
Basic  Attributes  Test  (BAT),  was  implemented  by  the  USAF  in  1992  to  augment  selection  decisions  formerly 
based  on  the  AFOQT.  The  USA  implemented  a  similar  computer-based  battery  to  augment  classification 
decisions  student  aviators  selected  on  the  basis  of  the  FAST.  These  batteries  and  their  contribution  to  selection 
and  classification  will  be  discussed  in  more  detail  later  ic  this  article. 

Results  have  been  mixed  for  background  and  personality  tests,  although  background  tests  are  generally 
considered  to  be  the  best  predictor  of  training  attrition  (Hilton  &  Dolgin,  1991).  Theoretically,  background 
tests  reflect  what  a  person  has  done  in  the  past.  Presumably,  tests  that  measure  a  person's  knowledge  and 
interest  in  aviation  predict  the  individual’s  ultimate  interest  in  an  aviation  career  (Street  &.  Dolgin,  1992). 
Research  with  personality  tests  has  generally  tried  to  identify  behavioral  and  emotional  characteristics  that 
improve  or  lessen  the  likelihood  of  success  in  aviation.  The  operational  contribution  of  personality  and 
background  tests  has  been  questioned  because  of  the  high  susceptibility  of  such  tests  to  faking  (Davis,  1989; 
Retzlaff  &.  Gibertini,  1987).  Nevertheless,  the  USN  AQT/FAR  and  its  revision,  the  ASTB,  include  a 
background  questionnaire  that  is  used  to  predict  early  training  attrition.  The  USA  and  USAF  no  longer  include 
background  questionnaires  in  testing  of  pilot  candidates. 

During  the  late  1980s,  Damos  (1987)  argued  that  pilot-selectiou  decisions  could  be  improved  by  testing  a 
broader  spectrum  of  the  five  major  domains.  This  idea  had  been  alluded  to  as  early  as  WWI  (see  Parsons, 
1918),  although  the  complexity  and  expense  of  such  broad-spectrum  batteries  was  prohibitive  until  the  advent  of 
high-speed  microcomputers.  Street,  Chapman,  and  Helton  (1993)  found  that  broad-spectrum  batteries  explain 
more  variance  in  training  outcome  predictions  than  narrow-spectrum  batteries  that  sample  fewer  domains. 
Recently,  there  has  been  increasing  military  interest  in  predicting  not  only  how  well  a  student  pilot  will  do  in 
training  but  also  which  airframe  would  best  suit  the  individual.  There  is  evidence  that  some  of  the  same 
psychological  tests  used  for  selection  of  military  aviators  may  also  be  valid  for  training  classification  (Intano, 
House,  &  Lofaro,  1991;  Siem  &  Alley,  1992).  The  Army,  Navy/Marine  Corps  (USN/USMC),  and  USAF 
have  each  considered  a  number  of  training  performance  and  test  battery  models  to  make  training  classification 
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decisions.  Of  the  services,  the  USA  is  the  only  branch  to  classify  student  pilots  based  on  a  training/test 
performance  model.  As  the  Department  of  Defense  faces  increasing  pressures  to  consolidate  training  resources, 
the  selection  models  used  by  the  other  services  should  also  be  investigated  for  classification  purposes. 

Historically,  the  USN  and  USAF  have  relied  on  rigorous  entry  standards  to  s*  lect  the  most  qualified  student 
aviators  ana  pilots.  For  both  the  USN  and  the  USAF,  classification  of  students  into  aircraft  training  tracks  was 
based  primarily  on  individual  student  performance  and  staffing  requirements.  The  USA  system  for  acquiring 
student  rotary  wing  aviators  is  generally  less  rigoious  and  emphasizes  classification  very  early  in  the  training 
program.  Intano  et  al.  (1991)  reported  that  the  USA  used  the  results  of  an  automated  battery  of  psychomotor, 
ability,  and  personality  tests  and  student  training  performance  to  assign  student  helicopter  pilots  into  one  of  four 
advanced  tracks  (UH-1,  AH-1,  OH-57,  or  UH-60).  The  USA  classification  test  battery,  the  Multi-Track  Test 
Battery  (MTTB),  was  a  synthesis  of  computer-based  tests  developed  by  the  USAF,  Naval  Aerospace  Medical 
Research  Laboratory  (NAMRL),  and  Army  Research  Institute  Aviation  Research  and  Development  Activity 
(ARIARDA).  Intano  and  Howse  (1992)  farther  reported  that  primary  flight  and  academic  grades  could  be 
predicted  by  various  subtests  of  the  MTTB.  We  were  not  able  to  find  any  investigations  applying  the  MTTB  to 
training  grades  in  the  advanced  tracks  (UH-i,  etc.). 

The  USAF  has  also  investigated  the  value  of  computer-based  selection  tests  for  classification.  For 
example,  Siem  and  Alley  (1992)  reported  that  modest  improvements  in  training  performance  could  be  obtained 
by  optimally  assigning  USAF  pilot  candidates  into  training  tracks.  They  based  the! :  conclusions  on  ratings  of 
200  pilot  candidate  records  by  57  male  USAF  instructor  pilots.  The  200  candidate  records  were  divided  into  4 
groups  of  50  and  rank  ordered  (from  1  to  50)  according  to  performance  on  the  USAF  AFOQT  and  the  BAT. 

A  prediction  model  based  on  the  rankings  was  then  compared  to  random  training  track  assignment.  The  results 
confirmed  an  improvement  in  predicted  performance.  They  also  found  that  information  processing  accuracy, 
resource  allocation,  and  psychomotor  coordination  were  considered  most  important  for  the  fighter  track.  In  a 
similar  study,  Siem  (1990)  investigated  whether  various  test  scores  (AFOQT  and  BAT)  and  background  data 
could  predict  assignment  to  the  USAF  fighter  training  track.  He  developed  a  significant  cross-validated 
regression  model  that  accounted  for  16%  of  the  variance  in  fighter  track  prediction  in  a  sample  of  426  USAF 
pilot  candidates.  These  results  are  consistent  with  USA  research  (especially  Intano  &  Howse,  1992)  and  naval 
research  (especially  Shull  &  Griffin,  1990). 

The  determination  of  which  track  is  appropriate  for  the  student  is  commonly  called  "pipeline  selection"  in 
naval  aviation.  The  USN  and  USMC  divide  pilot  training  into  three  primary  tracks:  1)  strike,  2)  rotary  wing, 
and  3)  maritime  patrol.  The  USN  has  one  additional  pipeline  for  the  E-2  and  C-2  aircraft.  The  strike  pipeline 
includes  fighter  and  attack  aircraft  such  as  the  F/A-18,  F-14,  A-6,  and  EA-6B  for  the  USN.  The  USMC 
includes  the  F/A-18,  EA-6B,  and  AV-8B  in  the  strike  category.  Strike  pipeline  students  are  trained  for  arrested 
carrier  landings.  The  rotary-wing  track  includes  all  helicopters  (HELO)  for  the  USN  and  USMC.  The 
maritime  pipeline  includes  other  multiple  engine  aircraft  such  as  the  C-9,  C-130,  C-12,  and  P-3.  Finally,  the 
E2-C2  pipeline  trains  for  the  E-2  and  C-2  aircraft.  This  pipeline  is  the  only  nonattack  category  trained  for 
arrested  carrier  landings  and  multiple  engine. 

Currently,  pipeline  selection  for  the  USN  and  USMC  is  based  primarily  on  student  performance  during 
primary  training.  Additional  factors  are  the  needs  of  the  USN/USMC  and  individual  student  desires.  Quotas 
for  each  pipeline  are  set  annually  by  the  USN  and  USMC.  Students  can  select  from  the  available  training 
pipelines  based  on  training  activity  grades,  flight  training  performance,  class  standing,  and  personal  preference. 
Traditionally,  the  most  desirable  pipeline  is  strike  followed  by  rotary  wing  and  maritime  patrol,  although  the 
majority  of  training  billets  are  divided  between  strike  and  rotary  wing.  As  the  most  desirable  pipelines  are 
filled,  fewer  choices  are  left  for  lesser  performers. 

Few  investigations  of  optimal  pilot  truning  assignment  have  been  conducted  in  naval  aviation.  One  (Shull 
&.  Griffin,  1990)  investigated  the  performance  of  naval  aviators  from  various  aircraft  communities  on  a  battery 
of  cogniti'  e  and  performance  tests.  In  their  study,  performance  on  a  battery  of  computer-based 
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cognitive/performance  tests  for  three  samples  of  naval  aviators  (F-14,  /V  -  66;  F/A-18,  N  =  67;  and  HELO,  <V 
»  39)  was  compared  to  the  test  performance  of  a  sample  of  student  naval  aviators  (N  =  177).  Analyses  of 
variance  results  indicated  that  strike  pilots  (F-14  and  F/A-18)  made  significantly  fewer  errors  on  a  psychomotor 
tracking  and  dichotic  listening  task  than  the  HELO  pilots. 

In  another  investigation,  Shull  (1991)  compared  the  performance  of  USMC  AV-8B,  Harrier,  pilots  to  a 
sample  of  aviation  indoctrination  students.  In  that  study,  the  testing  performance  results  of  the  sample  collected 
in  the  Shull  and  Griffin  (1990)  investigation  were  compared  to  a  sample  of  32  USMC  AV-8B  pilots.  The  AV- 
8B  is  a  jet  attack  aircraft  capable  of  vertical  takeoff  and  landing  (VTOL)  and  very  low  speed/ stationary  hover 
operation.  It  is  piloted  by  aviators  selected  from  the  jet-student  training  pipeline.  Analyses  of  variance  revealed 
few  significant  differences,  although  the  AV-8B  aviators  did  make  more  errors  on  the  psycho  motor  tracking  task 
than  the  F-14  and  ^ /A-18  aviators.  In  addition,  HELO  pilots  made  significantly  more  psycisomotor  tracking 
errors  than  any  of  the  fixed-wing  aviators  in  the  study.  No  differences  in  test  performance  were  found  for 
experience  in  any  of  the  aviator  communities.  Based  on  differences  in  the  pilot  communities,  the  authors 
suggested  that  psychomotor  coordination  may  have  been  considered  in  the  assignment  of  aviators. 

Other  researchers  have  found  that  some  aspects  of  naval  pilot  performance  after  leaving  training  could  be 
predicted  by  certain  tests  and  actual  training  performance.  For  example,  Bnclsoo.  Burger,  and  Gallagher 
(1972)  found  a  significant  relationship  between  simulated  performance  (tests  and  actual  training  performance) 
and  performance  in  initial  carrier  landing  qualifications.  Similarly,  Griffin,  Morrison,  Amenon.  and  Hamilton 
(1987)  found  that  certain  psychomotor  tracking  and  dichotic  listening  variables  correlated  significantly  with 
some  elements  of  air-combat  maneuvering  (ACM)  performance  in  a  sample  of  USMC  F-4  pilots.  Comparisons 
of  psychomotor  and  cognitive  tasks  to  performance  in  operational  settings  have  been  less  promising.  Griffin  and 
Shull  (1990)  found  no  relationship  between  automated  tests  and  performance  in  a  F/A-18  fleet  replacement 
squadron.  In  a  related  study,  no  relationship  was  found  between  ACM  performance  in  a  tactical  F-14  squadron 
(Shull  &  Griffin,  1987). 

In  each  of  these  investigations,  cognitive  and  psychomo tor  testing  has  been  used  to  predict  later  training 
track  selection  and  performance.  Only  one  investigation  found  in  the  literature  addressed  personality  testing. 
Picano  (1991)  examined  the  relationships  between  personality  type  and  aircraft  assignment  in  a  sample  of  1 70 
experienced  USA  pilots.  He  used  a  nonclinical  measure  of  personality  designed  for  use  in  occupational  settings 
The  s  were  no  differences  between  the  roughly  equally  distributed  utility,  attack,  and  observation  groups. 
However,  rated  instructor  pilots  were  significantly  more  compe'otve  tLas  nonrated  pilots.  This  difference  could 
not  be  explained,  although  similar  instructor  billets  in  the  USN  are  highly  competitive.  As  a  result  those 
individuals  that  achieve  this  status  are  likely  to  be  highly  self-selected.  Of  particular  interest  to  our 
investigation,  there  were  no  significant  personality  trait  differences  between  the  three  groups  (i.e.,  utility, 
observation,  or  attack).  Picano  (1991)  did  not  investigate  the  contribution  of  cognitive  or  psychomotor  tests. 

In  response  to  ongoing  reductions  in  operational  military  funding,  aviatioa  training  dollars  will  also 
decrease.  The  effect  of  these  reductions  has  already  been  suggested  by  the  Center  for  Naval  Analysis  and 
Chief,  Naval  Education  and  Training.  Specifically,  piograms  may  be  shortened  or  cut  to  reduce  training  costs. 
Additionally,  consolidation  of  USAF,  USN,  and  USA  pilot-training  programs  has  been  proposed  within  the 
Department  of  Defense.  Of  particular  interest  to  our  investigation  is  s  proposal  to  consolidate  rotary -wing 
training  with  one  of  the  services.  We  believe  that  certain  psychological  tests  designed  for  selection  may  be 
valid  for  determining  the  optimal  training  pipelines  for  student  aviators. 

Despite  the  lack  of  conclusive  research  findings  regarding  the  utility  of  psychological  tests  for  optimal  pilot 
training  assignment,  certain  psychological  tests  have  demonstrated  utility  in  personnel  classification.  Reviews  of 
pilot  selection  and  classification  research  (Hilton  Sc  Dolgin,  1991;  Burke.  1993)  reveal  that  a  number  of 
psychological  tests  are  valid  predictors  of  training  success.  Additionally,  such  tests  may  have  utility  for 
classification  of  student  naval  aviators  into  an  optimal  aircraft  training  assignment.  Our  study  represents  the 
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first  attempt  to  investigate  the  utility  of  a  valid  set  of  experimental  psychological  selection  tests  for  naval 
aviation  training  classification.  The  hypothesis  of  interest  was  that  these  experimental  tests  would  contribute  to 
the  prediction  of  pipeline  selection  decisions.  Given  previously  cited  findings  linking  these  experimental  and 
other  tests  to  success  in  aviation  training,  we  expected  to  find  that  a  valid,  broad-spectrum,  computer-based 
selection  system  would  nearly  approximate  the  accuracy  of  pipeline  classification  decisions  currently  based  on 
primary  training  performance  for  student  naval  aviators. 


METHOD 


SAMPLE 

The  subjects  who  participated  in  the  current  study  were  student  naval  aviators,  preselected  for  naval  flight 
training  on  the  basis  of  their  performance  on  the  AQT  FAR  The  aibjects  took  the  NAMRL  computer-based 
psychomotor  tests  (CBPTs)  while  enrolled  m  an  aviation  indoctrination  program  prior  to  entering  flight  training 
Their  participation  in  this  project  was  strictly  voluntary.  Before  administering  the  tests,  ail  subjects  were 
informed  that  their  decision  to  participate  and  their  test  results  would  not  affect  their  status  in  the  flight  program 
and  would  not  be  entered  into  their  service  record.  The  testing  was  conducted  between  1988  and  1990.  The 
subjects  were  followed  through  primary  nasal  flight  training,  and  mfonaatsoe  such  as  flight  grades,  training 
success  or  future,  and  pipeline  assignment  w as  entered  into  a  computer  data  hate  Subjects  {$  «*  237)  ranged 
from  21  to  28  yean  old  (M  *  23.06.  SD  «  i .55)  when  tested  and  included  229  males  and  8  females.  This 
sample  was  broken  down  into  three  subgroups  heed  on  pipeline  assignment  at  the  end  of  primary  flight 
training  I)  jet  ( S  -  75).  2)  HELO  i &  ®  91).  3)  patrol  (S  -  7i).  Only  subjects  who  had  been  assigned  to  a 
pipeline  and  succeeded  ua  flight  training  were  included  ta  the  analyses. 

PROCEDURE 

Volunteer  student  navel  aviators  were  adnunaslcred  a  Natters  of  experimental  CBPTs  after  selection  and 
before  catering  flight  framing  We  tracked  undent  progress  through  primary  flight  train  tag  and  compared 
various  CBPT  and  AQT  FAR  test  kotos  to  their  aircraft  training  track  assignments.  The  primary  flight  training 
syllabus  take*  approximately  6  months  depending  on  student  progress.  The  primary  syllabus  includes  cn 
extensive  ground  school  and  static  sunuiatacn  before  students  ester  the  cutkpst  Primary  flight  craning  is 
conducted  ta  the  Beechcraff  T  MC  turbo-mentor  The  T-MC  ts  a  turbo-prop  instrument -rated  fixed-wing 
aircraft  Dtctuoas  regarding  aircraft  tranmg  assignment  ace  made  during  the  final  stages  of  primary  training 
and  are  based  primarily  on  malm*  icademac  and  flight  Turin j  trades  During  ike  period  of  samp It  collection, 
students  were  ranked  and  offered  as  assignment  ranging  from  yet  for  superior  performers  to  HE  10 


The  AQT  FAR  is  a  paper -and -pencil  aptitude  lest  battery  that  includes  five  different  subtests  that  are  divided 
into  two  ptkrt  selection  composite*.  a  general  ability  ntimtlr  {AQT)  md  an  -■im*1*  of  aptitude  for  flying 
(FAR)  The  AQT  FAR  uses  a  tuned,  multiple -caosce  format.  The  first  part,  the  AQT.  is  made  up  of  reading, 
arithmetic,  and  science  <pac*uo**  that  are  related  to  a  typacnl  college  experience  The  second  part,  the  FAR.  is 
comprised  of  three  subaests  l )  the  Mechanatai  Comprehension  Test  (MCT).  2)  the  Spatial  Apperception  test 
(SAT),  and  3)  the  Baognphicai  Inventory  (BI).  The  MCT  is  a  test  of  rensoomg  and  comprehension 

that  uses  pictures  and  word  problems.  The  SAT  is  a  test  of  spatial  reanoniag  that  uses  pictures  of  terrain  as 
they  would  he  viewed  ‘rota  various  cockpu  pitch -and-roll  configurations  The  final  component,  the  BI.  is  a  test 
of  attitudes  and  interests  related  to  a  ashlars  and  or  aviation  career  We  were  primarily  interested  in  the  AQT 
and  FAR  stanme  composite  kotos,  although  we  also  included  the  SAT  and  MCT  subtest  kotos  in  our  analyses. 
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We  were  also  interested  in  one  CBPT  developed  by  NAMRL  (see  Blower  &  Dolgin,  1991).  Research  has 
identified  that  the  Psychomotor  Test/  Dichotic  Listening  Test  (PMT/DLT)  is  the  most  powerful  of  the  NAMRL 
CBPTs  (Delaney,  1992;  Street,  Chapman,  &  Helton,  1993).  It  comprises  seven  subtests  that  measure  eye-hand- 
foot  coordination,  divided  attention,  and  selective  attention.  An  Apple  He  version  of  the  NAMRL  PMT/DLT 
was  used  in  the  current  investigation.  Additional  testing  station  components  included  an  Amdek  Color  I  Plus 
Monitor,  an  Apple  He  numeric  keypad,  sound  synthesizer  card,  locally  produced  rudder  pedals,  and  two  high- 
fidelity  joysticks.  The  PMT  component  involves  stick  (S),  rudder  pedal  (R),  and  throttle  (T)  controls  that  move 
different  computer-generated  cursors.  Variables  derived  from  the  PMT  were  logarithmically  transformed  pixel 
error  composite  scores.  The  DLT  was  designed  to  measure  individual  differences  in  selective  attention  to 
different  digit  and  letter  sequences  presented  to  each  ea/  simultaneously.  The  DLT  variables  were  error  scores. 
A  single  logarithmic  composite  score  was  derived  for  each  of  the  seven  subtests.  The  subtests  are  arranged  in 
ascending  order  of  difficulty  with  the  final  subtest  involving  coordination  of  three  cursors.  A  detailed 
description  of  the  PMT/DLT  subtests  and  variables  may  be  found  in  Blower  and  Dolgin  (1991)  and  Shull 
(1991).  The  presentation  order,  administration  time,  and  description  of  the  various  components  of  the 
PMT/DLT  are  presented  in  Table  1. 


Table  1.  Sequence,  Description,  and  Administration  Time  of  the  PMT/DLT  Subtests. 


Sequence 

Description  and  variable  name 

Test  time  (min) 
Individual/cumulative 

1. 

Single  psychomotor  task  (PMT),  stick  only  [PMT(S)] 

07 

/ 

07 

2. 

Single  dichotic  listening  task  (DLT) 

16 

/ 

23 

3. 

First  multitask  [PMT(S)/DLT] 

05 

/ 

28 

4. 

Single  (PMT),  stick  and  rodder  [PMT(S  +  R)] 

10 

/ 

38 

5. 

Second  multitask  [PMT1(S  +  R)/DLTJ 

05 

/ 

43 

6. 

Third  multitask  [PMT2(S  +  R)/DLT] 

05 

/ 

48 

7. 

Single  PMT;  stick,  rodder,  and  throttle  [PMT(S  +  R+T)] 

07 

/ 

56 

DATA  ANALYSIS 

Initially,  we  conducted  a  series  of  one-way  analysis  of  variance  (ANOVA)  to  determine  the  differences  of 
sex  and  age  among  the  three  pipeline  subgroups  on  various  variables.  Next,  we  conducted  hieiarchical  and 
stepwise  discriminant  function  analyses  to  characterize  the  relationship  between  pipeline  assignment  and 
AQT/FAR  and  CBPT  variables.  We  conducted  a  series  of  stepwise  analysis  of  the  field  of  CBPT  variables  to 
determine  which  ones  contributed  to  the  prediction  of  pipeline  assignment.  Because  the  PMT/DLT  were  the 
only  CBPT  variables  to  remain  in  the  equation,  we  retained  them  in  the  final  equation.  Our  subjects  were 
selected  for  initial  flight  training  based  on  their  AQT/FAR  test  scores,  so  those  scores  were  entered  into  the 
discriminant  function  before  the  PMT/DLT  variables.  This  procedure  allowed  us  to  estimate  the  amount  of 
variance  unique  to  the  individual  CBPT  variables  after  the  AQT/FAR  variables.  Finally,  we  developed  a  model 
to  predict  jet  versus  other  pipeline  assignment  (HELO  and  patrol).  We  conducted  the  same  discriminant 
function  analyses  described  for  the  three-subgroup  model. 
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RESULTS 


DESCRIPTIVE  STATISTICS 

Descriptive  statistics  of  the  test  performance  for  the  three  subgroups  of  student  naval  aviators  (SNA)  are 
presented  in  Table  2.  Increasing  values  for  the  AQT  and  FAR  indicate  decreased  errors  and  superior 
performance,  while  increasing  values  for  the  various  PMT/DLT  composite  variables  indicate  increased  errors 
and  poorer  test  performance,  As  described  earlier,  the  SNAs  were  divided  into  jet,  HELO,  and  patrol 
subgroups  according  to  what  pipeline  they  were  assigned  to  at  the  end  of  primary  flight  training.  One-way 
ANOVAs  showed  no  significant  differences  for  sex  on  any  of  the  test  variables.  In  addition,  there  were  no 
significant  differences  among  the  SNA  groups  for  age. 


Table  2.  Descriptive  Statistics  of  the  Variables  for  the  Student  Aviator  Subgroups  [Mean  (5D)J.  * 


Variable 

Jet  ( N  -  75) 

HELO  (N  -  91) 

Patrol  (A/  =>  71) 

AQT 

5.95  (1.33) 

5.71  (1.30) 

5.85  (1.24) 

FAR 

7.81  (1.36) 

6.93  (1.77) 

6.79  (1.68) 

PMT(S) 

4.02  (0.22) 

4.18  (0.29) 

4.19  (0.29) 

DLT 

0.74  (0.24) 

0.81  (0.25) 

0.78  (0.25) 

DLT/PMT(S) 

0.87  (0.29) 

0.95  (0.29) 

0.93  (0.36) 

PMT(S)/DLT 

3.55  (0.23) 

3.74  (0.29) 

3.75  (0.32) 

PMT1(S+R) 

4.53  (0.15) 

4.66  (0.22) 

4.65  (0.23) 

PMT1&2(S  +  R)  Mean 

3.94  (0.22) 

4.09  (0.26) 

4.07  (0.23) 

DLT/PMT(S+R) 

0.90  (0.26) 

0.99  (0.24) 

0.97  (0.31) 

PMT(S  +  R+T) 

4.13  (0.15) 

4.28  (0.19) 

4.21  (0.19) 

*  Increasing  magnitude  on  the  PMT/DLT  variables  indicates  increased  tracking  and  listening  response  error. 
The  FAR  and  AQT  are  stanine  scores  and  increasing  magnitude  indicates  superior  performance. 


We  found  significant  differences  among  the  SNA  groups  for  many  of  the  test  variables  using  one-way 
ANOVAs  and  the  Scheffe  post-hoc  test.  Table  3  presents  the  results  for  the  one-way  ANOVAs  for  the  tests 
among  the  SNA  subgroups.  As  shown  in  Table  3,  the  jet  pipeline  SNAs  performed  significantly  better  than 
those  in  the  HELO  and  patrol  pipelines  on  the  FAR.  The  HELO  and  patrol  pipeline  students  also  made 
significantly  more  errors  on  many  of  the  PMT/DLT  tests.  Interestingly,  only  the  tracking  performance 
variables  were  significantly  different  for  the  groups.  The  DLT  component  was  not  significantly  different  for 
any  of  the  three  groups.  Finally,  there  were  no  significant  differences  between  the  HELO  and  patrol  pipeline 
students  on  any  of  the  variables. 
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Table  3.  ANOVAs  and  Between-Group  Comparisons  for  SNA  Pipeline  Subgroups. 


Variable 

F 

(DF) 

P 

Significant  pairwise 

AQT 

0.67 

(2,  234) 

* 

FAR 

8.81 

(2,  234) 

.0002 

jet  >  HELO,  patrol 

PMT(S) 

9.02 

(2,  234) 

.0001 

jet  <  HELO,  patrol 

DLT 

1.44 

(2,  234) 

* 

DLT/PMT(S) 

1.60 

(2,  234) 

PMT(S)/DLT 

12.06 

(2,  234) 

.00001 

jet  <  HELO,  patrol 

PMT1(S+R) 

9.97 

(2,  232) 

.00006 

jet  <  HELO,  patrol 

PMT1&2(S+R)  Mean 

8.75 

(2,  233) 

.0002 

jet  <  HELO,  patrol 

DLT/PMT(S  +  R) 

2.79 

(2,  234) 

■it 

PMT(S+R+T) 

12.78 

(2,  234) 

.00001 

jet  <  HELO,  patrol 

*  not  significant 

Next,  we  conducted  canonical  discriminant  function  (DF)  analyses  to  assess  the  prediction  of  membership  in 
the  jet,  HELO,  and  patrol  SNA  subgroups.  Because  the  students  were  selected  initially  on  the  basis  of  the  AQT 
and  FAR,  we  retained  only  those  variables  in  the  equation.  A  significant  (DF)  was  calculated,  with  a  combined 
X2  (2)  “  16.46,  p  <  .002.  This  indicates  that  there  was  a  statistical  difference  between  the  means  of  the  three 
SNA  subgroups.  The  Pearson  correlation  coefficient  was  R  *  .26.  With  the  probability  for  assignment  set  at 
the  proportions  for  final  actual  assignment,  the  DF  explained  6.8%  of  the  total  variance  and  accurately  classified 
44,7%  of  the  cases.  Table  4  presents  the  classification  matrix  for  the  AQT/FAR  model. 

When  we  entered  the  additional  PMT/DLT  variables,  the  DF  was  also  significant  ( x 2  (6)  -  37. 148,  p  < 
.0002).  A  Pearson  correlation  coefficient  of  .38  was  obtained  with  this  enhanced  model.  Using  the  same 
proportional  probability  for  actual  group  assignment,  the  DF  explained  14.4%  of  tb*  total  variance  and 
accurately  classified  45.7%  of  the  cases.  Table  5  presents  the  DF  for  the  enhanced  model. 

Table  4.  Aviation  Qualification  Test/Flight  Aptitude  Rating  Classification  Matrix.* 


_ Predicted  Group  Membership _ 

Percent 

Actual  Group _ Jgl _ HELP _ Patrol _ Correct 


Jet 

46 

29 

0 

61.3 

HELO 

33 

57 

1 

62.6 

Patrol 

22 

46 

3 

4.2 

*  Percent  of  grouped  cases  correctly  classified:  44.7  % 
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Table  5.  Enhanced  PMT/DLT  Model  Classification  Matrix.* 


Predicted  Group  Membership 


■AgtvaLGrauB- 


JsL 


HELP 


Patrol 


Percent 

Correct 


Jet 

45 

26 

4 

60.0 

HELO 

26 

57 

6 

64.0 

Patrol 

24 

41 

5 

7.1 

*  Percent  of  grouped  cases  correctly  clas  ;  fied:  45.7  % 

Because  the  HELO  and  patrol  pipelines  were  not  significantly  different  on  any  of  the  PMT/DLT  variables, 
we  conducted  one  additional  set  of  analyses  combining  those  two  subgroups.  In  the  resulting  analyses,  we 
attempted  to  discriminate  the  jet  from  the  combined  HELO/patrol  subgroup.  Using  the  savd  procedures 
outlined  for  the  three-subgroup  model,  the  DF  for  the  AQT  and  FAR  was  significant  (x7  (2)  =  16.367,  p  < 
.0002).  The  Pearson  correlation  coefficient  ( R  «  .26)  indicates  that  the  model  explained  6.7%  of  the  variance. 
As  seen  in  Table  6,  with  the  combined  HELO/patrol  subgroup,  the  AQT  and  FAR  model  achieved  a  higher 
number  of  accurate  predictions  than  the  three-subgroup  model  did.  Finally,  we  added  the  PMT/DLT  variables 
identified  in  our  earlier  ANOVAs  to  the  discriminant  function  analyses.  This  enhanced  PMT/DLT  model 
(Table  7)  achieved  significance  (x2  (6)  =  35.354,  p  <  .00001)  and  accounted  for  approximately  14.4%  of  the 
total  variance  ( R  =*  .38).  Of  particular  interest,  the  enhanced  model  accurately  classified  36%  of  the  jet 
pipeline  subgroup  compared  to  4%  for  the  AQT/FAR  model.  Further  review  of  the  information  presented  in 
Tables  6  and  7  indicates  that  the  enhanced  PMT/DLT  model  achieved  the  largest  percentage  of  correct 
classifications. 


Table  6.  Classification  Matrix  for  AQT/FAR  Model  with  HELO  and  Patrol  Subgroups  Combined.* 


_ Predicted  Group  Membership 

Actual  Group 

Jet 

HELO/Patrol 

Percent 

Correct 

Jet 

3 

72 

4.0 

HELO/Patrol 

2 

160 

98.8 

*  Percent  of  grouped  cases  correctly  Classified:  68.8% 
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Table  7.  Classification  Matrix  for  Enhanced  PMT/DLT  Model  with  HELO  and  Patrol  Subgroups  Combined.  * 


_ Predicted  Group  Membership 

Actual  Grouo 

Jet 

Percent 

Correct 

let 

27 

48 

36.0 

HELO/Patrol 

20 

142 

87.7 

*  Percent  of  grouped  cases  correctly  classified:  71.3% 


DISCUSSION 

We  have  demonstrated  that  a  sample  of  student  naval  aviators  who  enter  the  jet  pipeline  nuke  significantly 
fewer  tracking  errors  on  certain  computer-based  tests  than  students  who  enter  HELO  or  maritime  training. 

They  also  perform  significantly  better  on  paper-and-pencil  tests  of  flight  aptitude.  Furthermore,  we  found  that 
these  same  psychological  tests  can  improve  predictions  of  which  aircraft  training  pipeline  student  naval  aviators 
select.  Specifically,  we  found  that  the  FAR  has  validity  as  a  pipeline  predictor,  while  component  variables  of 
the  PMT/DLT  demonstrated  incremental  validity.  Our  results  indicated  that  of  the  pilots  predicted  for 
membership  in  one  of  the  three  training  pipelines,  approximately  71.3  %  were  actually  selected  based  on  an 
enhanced  testing  model.  It  is  important  to  note  that  this  level  of  accuracy  was  achieved  before  the  students  had 
even  entered  ground  school.  Related  studies  have  demonstrated  that  these  sam >  tests  can  improve  predictions  of 
performance  in  aviation  training.  Taken  together,  these  results  suggest  that  optimal  pipeline  assignment  based 
on  selection  tests  would  result  in  improvements  in  training  assignments  and  actual  training  performance.  The 
increases  in  performance  may  be  modest.  However,  Cascio  (1991)  cited  that  even  small  improvements  in 
performance  generally  result  in  cost  savings  to  organizations.  These  cost  savings  could  be  substantial  for  the 
USN  and  USMC. 

We  believe  that  the  results  of  this  investigation  lend  further  empirical  support  for  a  set  of  valid 
psychological  tests  in  predicting  the  flight  training  performance  of  student  naval  aviators.  The  results  are  also 
consistent  with  research  conducted  by  the  U.S.  Army  and  indicate  that  the  computer-based  PMT/DLT  can  be 
used  to  optimally  assign  students  to  rotary-  or  fixed-wing  pilot  training.  This  was  expected.  As  indicated 
earlier  in  this  paper,  the  U.S.  Army  incorporated  the  computer-based  PMT/DLT  developed  at  NAMRL  into  the 
MTTB  in  the  late  1980s.  Even  though  the  test  software  driver  was  modified  to  meet  hardware  needs,  it  is 
essentially  the  same  as  the  current  NAMRL  PMT/DLT.  The  U.S.  Army  MTTB  PMT/DLT  has  become  the 
premier  test  for  optimal  training  assignment  of  U.S.  Army  rotary-wing  pilots. 

Additionally,  the  results  are  consistent  with  U.S.  Air  Force  research  with  the  BAT.  Valid,  broad-spectrum 
computer-based  tests  can  improve  the  prediction  of  flight-training  performance.  They  can  also  optimize  training 
assignments.  This  is  particularly  useful  for  the  naval  services  in  the  prediction  of  whether  students  should  be 
considered  as  candidates  for  the  jet  pipeline.  As  the  naval  services  are  faced  with  increasing  pressure  to  reduce 
costs  and  consolidate  training  resources  with  the  other  services,  a  valid  computer-based  selection  system  would 
improve  selection  and  classification  decisions. 
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